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Abstract — Recently, lattice-reduction-aided detectors have 
been proposed for multiple-input multiple-output (MIMO) sys- 
tems to give performance with full diversity like maximum 
likelihood receiver, and yet with complexity similar to linear 
receivers. However, these lattice-reduction-aided detectors are 
based on the traditional LLL reduction algorithm that was 
originally introduced for reducing real lattice bases, in spite 
of the fact that the channel matrices are inherently complex- 
valued. In this paper, we introduce the complex LLL algorithm 
for direct application to reduce the basis of a complex lattice 
which is naturally defined by a complex-valued channel matrix. 
We prove that complex LLL reduction-aided detection can also 
achieve full diversity. Our analysis reveals that the new complex 
LLL algorithm can achieve a reduction in complexity of nearly 
50% over the traditional LLL algorithm, and this is confirmed 
by simulation. It is noteworthy that the complex LLL algorithm 
aforementioned has nearly the same bit-error-rate performance 
as the traditional LLL algorithm. 

Index Terms — lattice reduction, multiple-input multiple-output 
(MIMO), complexity reduction, complex-valued algorithm 



I. Introduction 

BY exploiting the linearity of a communication chan- 
nel and the lattice structure of the modulation, many 
detection problems can be interpreted as the problem of 
finding the closest lattice point. This lattice viewpoint of 
detection problems [l]-[3] forms the foundation of many 
low-complexity high-performance lattice-based detectors, such 
as Pohst's sphere decoder and various approximate lattice 
decoders (see [4] and the references therein). However, since 
the traditional lattice formulation is only directly applicable to 
a real-valued channel matrix, most conventional lattice-based 
detectors were derived based on the real-valued equivalent of 
the complex-valued channel matrix. This approach doubles the 
channel matrix dimension and may lead to an unnecessarily 
complicated detector. This insight suggests the possibility of 
deriving even simpler detectors for complex-valued channel 
matrices by introducing the complex lattice formulation. 

Recently, by exploiting the lattice structure of wireless 
multiple-antennas systems, lattice reduction is employed to 
improve the performance of MIMO detection [4]-[8] and 
precoding [9]. The most commonly used and practical lattice 
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reduction algorithm is the LLL reduction algorithm [10]. Its 
generalization, not just to the complex number field, but to 
the Euclidean ring in general, was introduced in [11]. In this 
paper, we investigate the implementation complexity and per- 
formance of of the complex LLL algorithm for the application 
of MIMO detection. From analytic and simulation results, it is 
shown that the average overall complexity of our accelerated 
complex LLL (CLLL) reduction algorithm is nearly half 
of that of the real LLL (RLLL) reduction algorithm. Like 
the RLLL algorithm, linear detectors employing the CLLL 
algorithm can achieve full diversity in lattice-reduction-aided 
decoding. Moreover, the bit-error-rate performance of MIMO 
detection schemes using CLLL-reduced basis is practically the 
same as that using RLLL-reduced basis. Thus, we achieve a 
reduction as large as 50% in the complexity of the reduction 
algorithm without sacrificing any performance. 

LLL reduction is often treated as part of preprocessing 
and hence its complexity is shared by symbols within the 
coherence time. However, in the situation where the channel 
matrix changes relatively rapidly, i.e. fast fading channel, the 
complexity of this preprocessing part becomes crucial. Our 
reduction algorithm makes it feasible to use reduction-aided 
detection schemes, which have much better performance than 
traditional schemes, even in fast fading channel. 

The rest of the paper is organized as follows. In Section 
II, the system model and its lattice viewpoint, along with 
the notations used throughout this paper, are given. The 
CLLL reduction algorithm and its complexity analysis are then 
described in Section III. Section IV proves the achievability of 
full diversity. In Section V, we present our simulation results. 
Finally, the paper is concluded in Section VI. 

II. Preliminaries 

A. System model 

Consider an n x m MIMO system consists of n transmitters 
and to receivers. The relationship between the transmitted 
column vector x and the received vector y is determined by 

y = Hx + w, (1) 

where H = [hi • • • h„] representing a flat-fading channel gain 
matrix, is a to x n complex matrix, all of its elements are 
independent complex Gaussian random variables CN(0, 1), 
and w is the additive noise vector, all of its elements are in- 
dependent complex random variable CN(0, 2a 2 ). We assume 
a full-rank MIMO system, i.e. H consists of n (< m) linearly 
independent vectors. The transmitted vector x is drawn from a 
finite set C, representing the complex constellation being used. 
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B. The complex lattice viewpoint 

A complex lattice A is the set of points 
c ihi : c i G S, hi G C m }, where is the set of 
complex integers Q = Z + iZ, i = \/— T. In the matrix 
form, A = {He : c e G n }, where H = [hi • ■ • h n ] represents 
a basis of the lattice A 1 . The lattice A can have infinitely 
many different bases other than H. In general, any matrix H' 
such that H' = HU, where U is an unimodular matrix (i.e. 
| detH| = 1 and all elements of H are complex integers), is 
also a basis of A. 

A lattice reduction algorithm is an algorithm that, given 
H, finds another basis H' which enjoys several "good" prop- 
erties. There are many definitions of lattice reduction, such 
as Minkowski reduction [12] and Korkine-Zolotareff (KZ) 
reduction [13]. (See, e.g.[14]-[17] for a modern description 
of them.) Among them, the most practically used is the LLL 
reduction algorithm (named after its inventors Lenstra, Lenstra 
and Lovasz [10]), whose running time is polynomial in the 
dimension of the lattice. 

The LLL reduction can be employed to improve the per- 
formance of MIMO detection schemes, assuming that the 
channel state information (i.e. the matrix H) is perfectly 
known at the receiver. Since the vector Hx can be viewed as a 
lattice point 2 , MIMO detection can be formulated as finding a 
reasonably close lattice point to Hx given y. By considering 
the reduced basis H' instead of H, it can be shown that the 
performance of traditional detection schemes like zero-forcing 
(ZF) and successive interference cancellation (SIC) can be 
greatly improved [4], [5]. Consider the MIMO system 

-l 
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For successive interference cancellation, we first apply QR 
decomposition to the reduced basis 



H' = QR, 



(3) 



such that Q is unitary and R is upper triangular. Then multiply 
the Hermitian of Q to y: Q H y and employ successive decision 
and cancellation [18] to obtain a hard-decision vector x'. 
Apply the unimodular transform to x' to obtain x = Ux' 
and finally, hard-limit x component-wise to a valid symbol 
vector x. Figure ^shows the block diagram of such a system. 
The performance is much better than performing successive 
interference cancellation detection on the original basis H. In 
fact, it has been proved that lattice-aided detection schemes 
using real lattice achieve full diversity [19], [20] . For more 
details on this lattice viewpoint, refer to [4], [5], [14]. 

C. Notation 

The following notation is used throughout this paper. Let z 
represent the complex conjugate of z. The conjugate transpose 
(Hermitian) of a matrix H is denoted by H H . Denote by H^ 
the Moore-Penrose pseudo-inverse of H. The inner product of 
two vectors hi and I12 is defined as (hi, h2) = hi. The set 

'Since the lattice basis is always arisen from the channel matrix, without 
the ambitguity, we use the same symbol H to denote both. 

2 To apply the lattice viewpoint, the constellation C has to be of lattice type, 
such as BPSK, QPSK, QAM or PAM. 



of orthogonal vectors generated by the Gram-Schmidt Orthog- 
onalization (GSO) procedure are represented as {h^, . . . , h* } 
which span the same space as {hi, . . . , h„}, and further let 

Hij = n-rejfn^ ■ The squared norm of h* is denoted as Hi = 

11 j 1 

||h*|| . 5R(z) and Q(z) denote the respective real and complex 
parts of a complex number z. Besides, [z] rounds z to the 
nearest integer, and if z is a complex number, it is done on both 
the real part and the complex part, i.e. \_z~\ — \jft(z)~\+il~s(z)~\ . 
For real number r, |r| denotes its absolute value. For complex 
number z, \z\ denotes its modulus: \z\ = \fz%. 

III. Complex LLL Reduction Algorithm 

In this section we describe a CLLL reduction algorithm. 
Traditionally, LLL reduction is performed on the real-valued 
equivalent matrix of the complex matrix H [1, pp.81] (c.f. [8], 
[9]): 

" »(H) -9(H) 
3(H) R(H) 



H, 



(4) 



Since the reduced basis matrix does not generally have the 
symmetric structure as in @, the detection part also has to 
be done in the real number field (c.f. [8]). This conversion, 
which causes a doubling in dimension, generally requires extra 
computations in the detection algorithm (during the processing 
phase) as well as the lattice reduction algorithm (during the 
preprocessing phase). 

Our algorithm works directly on complex lattices. Although 
the cost for each complex arithmetic operation is higher than 
its real counterpart, the total number of operations required 
for CLLL is fewer, leading to a lower overall complexity of 
CLLL. Moreover, the quality of CLLL-reduced basis is the 
same as that of the real LLL (RLLL) reduced basis. 

A. Principle of LLL reduction 

A basis H for a complex lattice is CLLL-reduced if both of 
the following conditions are satisfied: 



< 0.5 and |3?Oy)l < 0.5 
for 1 < j < i < n, and 

H k > (5- K, fc -i| 2 )H fc -i 



(5) 



(6) 



for 1 < k < n, where S with l/2<<5<lisa factor selected 
to achieve a good quality-complexity tradeoff [10]. 

Note that the value 6 = 1 can be used as well, but 
polynomial convergence of the algorithm is not guaranteed. 
If 71 = 2 and 6 — 1, then the basis has the properties 
|^(M2,i)| < 0.5, |S(M2,i)l < 0.5, and Hi < H 2 . This is 
exactly the complex Gaussian reduction presented by Yao and 
Wornell [5], For real lattices, it is well known that 2D RLLL 
reduction with 6 = 1 is equivalent to Gaussian reduction. 
We can see the parallel between real and complex lattice 
reductions. 

The CLLL reduction algorithm consists of 3 steps: 

1) A modified GSO procedure as in [4] to compute Hi', 

2) Size reduction process that aims to make basis vectors 
shorter and closer to orthogonal by asserting l|5} for all 

j < i; 
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3) Basis vectors swapping step. Two consecutive basis 
vectors hfc_i and will be swapped if (|6} is violated. 
The idea is that, after swapping, size reduction can be 
repeated to make basis vectors shorter. 
The two steps, size reduction and basis vectors swapping, 
iterate until ijfji is satisfied by all pairs of hfc_i and h^. The 
resultant basis is thus CLLL-reduced. 

Algorithm 1 gives the detailed description of the CLLL 
reduction algorithm. The algorithm also computes the uni- 
modular matrix U, which is required for our lattice-reduction- 
aided detection. This saves computational cost over explicitly 
calculating U = (Ht)H'. 

It is noteworthy that our algorithm has three distinctions 
from Napias [11]: 

• The condition (0 of size reduction is stronger than 
Ia%"| 2 < 0.5 given in [11], resulting in fewer size 
reduction computations. 

• The algorithm checks if \$t(/j, kik _i)\ > 0.5 or 

> 0.5 before doing size reduction, whereas 
[11] does not. The check will avoid unnecessary compu- 
tations and will improve the efficiency accordingly. 

• After swapping hfc_i and h^, Hk-i, Hk and some of 
the /ijj's needed to be updated to reflect the change. 
Instead of calling the GSO procedure again as in [11], 
the following updating formulas are executed to eliminate 
any unnecessary operations 





= h fc , 




(7) 




= hfe_i, 




(8) 








(9) 




fn k -i\ 

" M 1 u, )■ 




(10) 


Hk 
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(11) 
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Hk 
rtk-l 


, k < 


i < r{lT) 


ii-ik 


= Mi,fe-1 — MifcM&,fc-l) 


k < i 


<n,(13) 




= Hkj, l<3<k-2, 




(14) 




= fik-ij, 1 < j < k - 


2. 


(15) 



where h;, Hi and /iy denote the updated values of h;, 
Hi and /i^ respectively. 

The same algorithm can also be employed to reduce a real- 
valued lattice basis without any modification. Hence it can be 
viewed as a generalization of the traditional LLL algorithm. 

B. Complexity analysis 

The complexity of the LLL reduction algorithm depends on 
the distribution of the random basis matrix H. For simplicity, 
we assume that n = m, i.e. H is a square matrix. 

The speed of convergence can be determined by examining 
the product [10] (c.f. [4]) 

n i 

D = Y[d u di = n«j- (16) 

i=i j=i 



Algorithm 1. CLLL Reduction Algorithm 

Input: Lattice basis H = [hi ■ • • h„], factor 8 G 1). 
Output: CLLL-reduced basis H', unimodular matrix U = 
[ui • • • u n }. 

l: for j = 1 to n do 

2: H, ■ l.,.h. 

3: end for 

4: for j = 1 to n do 

5: for i = j + 1 to n do 

7: H, ■ H, //,,, 2 7^, 

8: end for 

9: end for 

10: U <- I n 

11: k <- 2 

12: while fc < n do 

13: if \U(fi k>k -i)\ > \ or |3?(M fe , fc _i)| > | then 

14: {H, /i} <- SIZE_REDUCE(H, fi,k,k- 1) 

15: end if 

16: if condition ^ is violated on k and k — 1 then 

17: swap and update using formulas (fTi- J 1 5i 

18: swap Ufc and Ufc_i 

19: k <— max(2, k - 1) 

20: else 

21: for j = fe — 2 to 1 step —1 do 

22: if \St(ji kj )\ > \ or \Q(ti kj )\ > \ then 

23: {H, fi} <- SIZE_REDUCE(H, M , fc, j) 

24: end if 

25: end for 

26: fc <— fe + 1 

27: end if 

28: end while 

29: return H as H', and U. 



D only changes when two basis vectors are swapped, i.e. 
H k < (S - \^k,k-i\ 2 )H k -\- After swapping, d { contracts by 
a factor of 6, while other d,'s are unchanged. As a result, 
D contracts by a factor of at least 5. Now it is clear that 
the parameter 8 dictates the speed of convergence. Hence, it 
is expected that CLLL and RLLL algorithms with the same 
value of 8 have a similar speed of convergence. 

Moreover, the norm B c of the longest column vector of H 
is equal to the norm B r of the longest column vector of H^, 
because 

B r = [5i(h)] 2 + [±3(h)] 2 = B c (17) 

which is clear from the form of in ®. Define B = 
B r = B c , it can be shown that the number of basis swapping 
performed for any H is 0(n 2 log B) [10] and the same applies 
to the case in which (|6} is satisfied. Since the whole algorithm 
starts at k = 2, and must terminate when k = n, the part 
between lines 21-25 must be executed for 0(n 2 \ogB) +n = 
0(n 2 logB) times. Therefore it can be easily seen that the 
complexity of LLL is actually dominated by this part, with 
overall complexity 0(n 4 logi?). 

To preliminarily estimate how much complexity can be 
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Algorithm 2. Subroutine SIZE.REDUCE 

Input: H = [hi • • • h„], p,, indices k, j, 1 < j < k < n. 
Output: H, /i. 



c «— 

u fc <- 
for / 



MM * 
end for 



- h fc - chj 

- U fc - CUj 

= 1 to j do 



saved by applying CLLL, compared with RLLL, we consider 
the CLLL-to-RLLL Complexity Ratio (CRCR) 



K 



n 4 log B 
(2n) 4 logB : 



(18) 



where K is an architecture-dependent factor meaning, on 
average, how many real arithmetic operations have to be exe- 
cuted per each complex operation. For example, if a complex 
addition requires 2 real arithmetic operations and a complex 
multiplication requires 6, then K = (6 + 2)/2 = 4 since the 
number of additions and multiplications are the same. 

However, there is one more factor affecting the complexity 
that we need to consider as well. Line 22 introduces a 
conditional test such that the execution of the subroutine 
SIZE_REDUCE at line 23 may be skipped sometimes. Denote 
P c (P r ) as the probability that this test is passed in CLLL 
(RLLL): 

P r (2n) = P{|Mfc, fc -i(2n)| >0.5}, Mfc,fc-iO) real (19) 
and 

P c (n) =P{|R[w k)fc _i(n)] | > 0.5 or |3[ MM _i(n)]| > 0.5} , 

(20) 

where, for clarity, the dependence on the dimension is shown 
explicitly. Then the CRCR dl 81 should be rewritten as: 



K 



n 2 logP • P c (n) • n 2 _K P c (n) 
(2n) 2 logP- P,.(2n) • (2nf ~ 16P r (2n) 



(21) 



By definition of /ife.fc-i, the random variables 
(n)],3[/ifc lfe _i(n)], and real-valued /i /c ^_i(2n) 
should have similar statistics. Moreover, result of our 
empirical studies, as shown in Table U for n < 22, indicated 
that both P r (2n) and P c (n) are small, decreasing with n. It 
is reasonable to assume that the event |5R(/ifc ,fc-i)| > 0.5 and 
\^s(fik,k~i)\ > 0.5 are statistically independent for circular 
symmetric complex Gaussian H. Then 

P c (n) « P {\U[p kik -i(n)}\ >0.5} 
+P{|9f[/i fc , fc _ 1 (n)]| >0.5} 
= 2P{|»^fc )fc _i(n)]| >0.5} 
= 2P r (2n). (22) 

Substituting this into (121 > and using the common value if = 4, 
we have 

CRCR « 4/16 x 2 = 1/2 (23) 



which means that CLLL algorithm will have half of the 
complexity of RLLL algorithm. Empirical results to be pre- 
sented will confirm the above prediction of 50% complexity 
reduction. 

Finally, it is important to note the implication of the complex 
lattice approach on the complexity of other components in 
lattice-reduction-aided detectors. If ZF or SIC is used in 
MIMO detection, the computation of the pseudo-inverse or 
the QR decomposition of the reduced channel matrix should 
be considered as part of the preprocessing as well. 

Since the RLLL-reduced basis matrix does not necessarily 
have a symmetric structure as (0), the pseudo-inverse or QR 
decomposition must be computed in the real number field. 
This, too, induces an extra complexity cost, as both pseudo- 
inverse and QR decomposition require 0(n 3 ) field operations 
[21]. By employing the CLLL reduction, hence avoiding the 
doubling of dimension, only if/2 3 = 1/2 for if = 4 of 
operations are needed. 

The processing part, on the other hand, requires 0(n 2 ) 
field operations. When K = 4, the complexity reduction 
obtained by avoiding dimension doubling is cancelled by the 
extra number of flops required for complex arithmetics. If 
more complicated schemes which require f2(n 2+ ^), £ > 
field operations (e.g. V-BLAST [22]) are used, the complexity 
may also be reduced in the processing part by performing 
computations in in the complex number field. 

IV. Full diversity 

Computer simulation always sees that lattice-reduction- 
aided detection achieves the full diversity of a MIMO fading 
channel (e.g. [4], [5], [8], [9]). For the case of real lattice 
and 5 = 3/4, the achievability has been proved in [19]. 
[20, Appendix] not only showed the full diversity, but also 
determined the gap between maximum likelihood detection 
(MLD) and LLL reduction-aided decoding. A systematic ap- 
proach was developed in [23] to obtain better upper bounds 
on the gap. To this end, [23] introduced proximity factors 
to measure the performance gap to MLD. It was shown that 
there exist constant upper bounds on the proximity factors for 
RLLL reduction. In this Section, we shall extend the analysis 
to CLLL reduction. 

Consider a fixed but arbitrary n-D complex lattice A. The 
decision regions of ZF and SIC have 2n faces. We only have 
to study n distances due to symmetry. The i-th distance of ZF 
is c?i,zF = 1 1 hi 1 1 sin^, for i = 1, . . . ,n, where 9i denotes the 
acute angle between and the linear space spanned by the 
other ri—l basis vectors hi, . . . , hj_i, hj+i, . . . , h„. For the 
SIC detector, the i-th distance is given by ||h*||. 

If the boundary effects are ignored, the minimum distance of 
ML detection is obviously duL — A(A), where A(A) denotes 
the length of the shortest vector in A. Following [23], we 
define the proximity factors for CLLL reduction as 



c A 
ft.ZF = SUp ■ 



A 2 (A) 



IhJPsin 2 ! 



c a A2 ( A ) 

Pj.SIC - SU P || h *|| 2 



sup ■ 



A 2 (A) 
Hi ' 



(24) 
(25) 
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where the supremum is taken over the CLLL-reduced bases 
H of all n-D complex lattices. Furthermore, we define p% F = 
maxi<i< n p^ ZF and p!? IC = maxi< 4 <„ pf slc , which quan- 
tify the worst-case loss in the minimum squared Euclidean 
distance. 

Using a union-bound argument [23], the error probability 
of ZF detection can be bounded as 

P e ,z F (SNR) < ^,ml(™) < <ml(^) (26) 

i—l Pi.Z¥ PZF 

for arbitrary SNR and A. A similar bound exists for SIC 
detection if boundary errors are ignored. The relation i26\ 
remains valid after averaging over H. Obviously the error rate 
curve of lattice-reduction-aided detection only show a shift 
in SNR, up to a multiplicative factor n. It is clear that it 
can achieve full diversity, since we know MLD achieve full 
diversity. The proximity factors measure the performance gap 
between MLD and lattice-reduction-aided detection at high 
SNR. 

In the following, we derive upper bounds on the proximity 
factors for CLLL reduction. The bounds are not necessarily 
the tightest possible; the main purpose here is to show the 
existence of constant bounds so that full diversity can be 
proven. 

A. CLLL-SIC 

By definition (|6j we have 

Hi>{5- l/v-iHW;-! >{8- l/2)Hi- U (27) 

as l/i^i-il 2 < 1/2 for CLLL reduction. By induction we can 
see 

Hj < of-'Hi, l<j<i<n, (28) 

where a = 1/(8 - 1/2) > 2. Substituting this into 

i-l 

INI 2 = «i + X)lWj| a Wi (29) 
due to the Gram-Schmidt orthogonalization, we obtain 

Hh.ll 2 < {l + ^a^/^Hi (30) 
/ 1 a 1 — a 



Hi 



(31) 



V 2 a- 1 
Replacing i by j and substituting d28l . we have 

for 1 < j < i < n. Since A (A) < ||hj|| and (03 holds for 
a ll j < it the l° ss m the i-th squared Euclidean distance is 
bounded by 



(32) 



r ( 1 a? — a 

Pi,SlC < ™n. 1 + t,— — 7- 



a 



i-3 



(33) 



To obtain an exponential upper bound, we show in the Ap- 
pendix that 



for a > 2. Note that fl33i is a better upper bound in general, 
though. 

The inequality J34> implies the bound 



and 



(35) 



(36) 



In particular, if n = 2 and 6 = 1 (a = 2), then pg IC < 2. This 
agrees with the maximum loss of 3 dB for the 2-D complex 
Gaussian reduction of Yao and Wornell [5]. 

Let us compare with the 2n-D RLLL reduction [23] 



Piic < 



(3 ={5- 1/4)" 



(37) 



If —1 in the exponent of a and j3 in d36i and d37t is ignored, 
the ratio of the two factors is 



Psic „ P 2n (S-1/2Y 



pg c a" [5-1/A)^- 
For the common choice (5 = 3/4, we have 



Psic „ (1/4)" 
Pgc ~ (1/2) 2 " 



1. 



(38) 



(39) 



which means RLLL and CLLL reduction have equal proximity 
factors, regardless of the dimension n. For any other value of 
5, we have pg IC > p§i C , because 

(5- 1/4) 2 > 6- 1/2 (40) 

where the equality holds if and only if 5 = 3/4. 



B. CLLL-ZF 

We need the following lemma to establish the upper bound 
for ZF. 

Lemma 1: For CLLL-reduced basis, 



sin.0,- > 



(41) 



,2 + V2y 

Lemma 1 extends Babai's lower bound on Q, L [24] to CLLL 
reduction. The proof basically follows Babai and is given 
in the Appendix. Lemma 1 along with the trivial inequality 
1 1 h,; 1 1 > A (A) leads to 



c 1 / 3 + 2^/2 



n 



n-l 



(42) 



|hj|| 2 < a^Hi, l<j<i<n 



(34) 



Since the upper bound is maximized when i = 1, we have 

A<(^)""' <43) 

Comparing (I43> and d36l >. we see that the constant (3+2v2) /2 
represents the inferiority of ZF to SIC. 

When n = 2 and 8 = 1, sin6»i > 2/(2 + >/2) and sin6»2 > 
V2/2. However, it is quite obvious that Q\ must be equal to 
6*2 here (this implies that the lower bound on 61 is not tight.) 
Hence p% F < 2, which again agrees with the 3 dB loss of 
complex Gaussian reduction [5], 
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Let us compare with the 2n-D RLLL reduction [23] 

, /<! r 

PzV< f 



(44) 



Again, ignore — 1 in the exponent of a and (3 in ( I43> and J44i . 
the ratio is 



(9/4) 2 (3-1/2) 
L(3 + 2V2)/2 (5-1/4) 2 
(5-1/2) 



1.74 x 



(45) 



(5- 1/4) 2 

The right-hand side of J45i is greater than one if <5 > 0.58, 
and is less than one otherwise. 

C. Properties of CLLL-reduced basis 

Without proof we give other properties of CLLL-reduced 
basis which are similar to those of RLLL-reduced basis [10]: 



d(A) < IJlNI < a n(n_1)/ *d(A), 

i=l 

||hi|| < a^-V^diA) 1 ^, 
o 1 - n A i <||hill 2 <a n - 1 A i 



(46) 

(47) 
(48) 



where d(A) = ^dct(H H H) for a basis H of lattice A, and 
Aj is the i-th successive minimum of A. These properties show 
in various senses that the vectors of a CLLL-reduced basis are 
not too long. 

V. Simulation results 

In this section we compare the average complexity and the 
error-rate performance when the reduced bases were used in 
MIMO detection. 

The average complexity was measured in terms of the 
average number of floating-point operations (flops) used. Sim- 
ulations were performed in Matlab, in which the number of 
flops equals 2 for complex addition and 6 for multiplication. 
For real numbers, both addition and multiplication require 
1 flop. Moreover, we assumed the costs of rounding and 
hard-limiting are negligible when compared to floating-point 
addition and multiplication. The LLL factor 5 was set to 0.99 
in all cases for the best performance. 

We demonstrate the average complexity of LLL-reduction- 
aided successive interference cancellation (LLL-SIC) detection 
scheme. The whole detection process can be divided into two 
parts: preprocessing and processing. The preprocessing part 
includes these operations: 

« Lattice reduction of channel matrix H. 

« QR decomposition of the reduced channel matrix H'. 
And the processing part includes: 

• Matrix multiplication Q H y. 

• Successive nulling and cancellation. 

• Unimodular transformation Ux\ where x' is the vector 
obtained by successive nulling and cancellation; and hard- 
limiting of the resultant vector to a valid modulation 
symbol vector. 



Note that QR decomposition can be replaced by GSO, which 
decompose H' into G consists of orthogonal column vectors 
and upper triangular matrix M T of all fiij. One might won- 
der if some speed could be gained by actually keeping the 
whole vectors h* (instead of just their squared norms Hi) 
in our LLL reduction algorithm such that G could also be 
obtained after the reduction. However, by doing so, many extra 
flops are required for updating after basis vectors swapping. 
Asymptotically, the complexity of the LLL algorithm would 
be 0(n 5 \ogB) instead of 0(n 4 logi3). Our simulation result 
also shows that, even in low dimensional systems, this "inte- 
grated" approach costs more than computing a "standalone" 
QR decomposition on the reduced basis afterward. Therefore 
we abandon this integrated approach. 

Table |n| shows the average complexity of the preprocessing 
and processing part of LLL-SIC. It can be seen that, by 
working on the complex lattice, the complexity of the entire 
preprocessing part was reduced by 45.1% for n = 2 (i.e. a 
2-transmitter-2-receiver system), and reduced by somewhere 
between 42.4% to 49.2% for larger n. 

In particular, the complexity reduction of the proposed 
CLLL reduction algorithm over the traditional RLLL is about 
44.2% to 50.5% for our selected range of n. 

About 40.4% to 47.1% of the computation was saved by 
computing QR decomposition in the complex number field. If 
we also assume the number of complex additions are roughly 
the same as the number of complex multiplications for this 
part, 4 flops are required for each complex number operation 
on average. Thus, we expect the complexity reduction of this 
part approaches 4/8 = 50% for sufficiently large n. 

The vector-error-rate (VER) and bit-error-rate (BER) per- 
formance when traditional RLLL-reduced and CLLL-reduced 
basis are used in MIMO detection are shown in Figures [2] 
and |51 respectively. The VER is the probability that at least 
one symbol in the transmitted vector are incorrectly detected. 
The MIMO system under consideration is a 6-transmitter-6- 
receiver uncoded system using 64-QAM. The system con- 
figuration is to maximize the multiplexing gain. Namely, 
each transmitter transmits its own symbol stream, independent 
other transmitters' data streams. The lattice-reduction-aided 
detection schemes being examined are SIC and ZF. For the 
sake of comparison, the performance of ZF, SIC, V-BLAST 
and MLD detectors are also shown. 

Both Figures and show that the CLLL and RLLL 
reduction algorithms result in practically identical VER and 
BER performance in MIMO detection, although the bounding 
values of their respective proximity factors are unequal in 
general. This can be explained as follows. 

• The proximity factors only quantify the worst-case losses, 
whereas the average error rate matters in fading channels. 
The worst case may occur with very low probability so 
that the average loss can be similar. 

• The bounds on the proximity factors are likely to be not 
tight enough. It is well known that the LLL reduction 
performs better than the theoretic exponential bound in 
practice. 

Nonetheless, it is clear that full diversity of the detectors is 
correctly predicted by the analysis in Section IV. 
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VI. Concluding Remarks 



This in turn requires 



In this paper, we extended the traditional LLL algorithm for 
reducing complex lattices. The resultant complex LLL algo- 
rithm was applied to complex-lattice-reduction-aided MIMO 
detection. We derived constant upper bounds on the proximity 
factors, thereby proving the achievability of full diversity. We 
showed that the complexity of the complex LLL algorithm is 
nearly half of that of the traditional algorithm. The complex 
LLL algorithm can achieve an average complexity saving 
of nearly 50% with negligible performance loss. Apart from 
MIMO detection, CLLL reduction can also find applications 
in the design of complex lattice codes [25], [26]. 



Appendix 



A. Proof of Ql 
We need to prove 



1 + 77 



1 a J — a 



2 a- 1 



(49) 



or equivalently 



x J 1 > a — 1 + -^{oP — a) 

1 i 1 

= —or H a — 1 

2 2 



(50) 



for a > 2. The inequality holds obviously when j = 1. We 
prove the rest by induction. Suppose that it holds when j = k; 
when j = k + 1 



a k+1 - a k = a(a k - a 1 " 1 ) 

( 1 k 1 
> a -a + -a - 1 
V 2 2 



= -a k+1 + a ( -a - 1 



~ 2 2 

where the second inequality follows from a > 2 and a/2— 1 > 
0. □ 



B. Proof of MB 

Following Babai's method [24], let a,; = —1 and at for 
t^i, l<t<nbe arbitrary complex numbers, and define 

n n 

7j = atHtj = dj + a t [itj- (51) 

t=j t=j+l 



We shall prove that 



E 

3=i 



7,| 2 > 



2 + V2 



2(n-i) 



(52) 



if ImwI — V2/2 for 1 < j < I < n. This can be proved by 
assuming the contrary, which leads to 



\7i\<e± 



2 + V2 



i<3 <n. (53) 



'2 + V2 



n—j 



i < j < n. 



(54) 



This is valid for j = n, since \a n \ = \j n \ < e by (15 li . For 
j < n we have 



t =J +i 

Using reverse induction, 




< E Ia t | V2/2. (55) 

t=j+i 



2 + \/2\" * \/2 



'2 + V2 N 



(56) 



Hence (I54> is proven. Letting j = i, we have |a;| < 1, but 
this contradicts a, = — 1. Thus d52i must be valid. 
Once again, following Babai [24] we have 



sin 2 Qi > |7j 



1 2 



(57) 



Then, excluding j < i terms from d57i yields a lower bound 

K; (58) 



sin 2 6>j > hj 



3=1 



2 ' L 3 

llhdl 2 ' 



Substituting (^8) and (03 yields, 



> 



> 



a 1 - 1 ■ n % 

, ,2(n-i) . 
2 ^ a »-n 



2+^2 



.2 + V2 

and this completes the proof of i41i . 
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Fig. 2. The VER performance of various LLL reduction-aided MIMO 
detectors in a 6 X 6 uncoded MIMO system using 64-QAM. 
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Fig. 3. The BER performance of various LLL reduction-aided MIMO 
detectors in a 6 X 6 uncoded MIMO system using 64-QAM. 



TABLE II 

Average complexity (in flops), assuming 2 flops per complex addition and 6 flops per complex multiplication. 
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Lattice reduction (5 = 


= 0.99) 
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RLLL 
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